|

1.

The language network as a natural kind within the broader landscape of the human brain.

Fedorenko, Evelina; Ivanova, Anna A; Regev, Tamar I.

Nat Rev Neurosci ; 25(5): 289-312, 2024 May.

Article En | MEDLINE | ID: mdl-38609551

Language behaviour is complex, but neuroscientific evidence disentangles it into distinct components supported by dedicated brain areas or networks. In this Review, we describe the 'core' language network, which includes left-hemisphere frontal and temporal areas, and show that it is strongly interconnected, independent of input and output modalities, causally important for language and language-selective. We discuss evidence that this language network plausibly stores language knowledge and supports core linguistic computations related to accessing words and constructions from memory and combining them to interpret (decode) or generate (encode) linguistic messages. We emphasize that the language network works closely with, but is distinct from, both lower-level - perceptual and motor - mechanisms and higher-level systems of knowledge and reasoning. The perceptual and motor mechanisms process linguistic signals, but, in contrast to the language network, are sensitive only to these signals' surface properties, not their meanings; the systems of knowledge and reasoning (such as the system that supports social reasoning) are sometimes engaged during language use but are not language-selective. This Review lays a foundation both for in-depth investigations of these different components of the language processing pipeline and for probing inter-component interactions.

Brain , Language , Humans , Brain/physiology , Nerve Net/physiology , Neural Pathways/physiology , Brain Mapping

2.

Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training.

Hosseini, Eghbal A; Schrimpf, Martin; Zhang, Yian; Bowman, Samuel; Zaslavsky, Noga; Fedorenko, Evelina.

Neurobiol Lang (Camb) ; 5(1): 43-63, 2024.

Article En | MEDLINE | ID: mdl-38645622

Artificial neural networks have emerged as computationally plausible models of human language processing. A major criticism of these models is that the amount of training data they receive far exceeds that of humans during language learning. Here, we use two complementary approaches to ask how the models' ability to capture human fMRI responses to sentences is affected by the amount of training data. First, we evaluate GPT-2 models trained on 1 million, 10 million, 100 million, or 1 billion words against an fMRI benchmark. We consider the 100-million-word model to be developmentally plausible in terms of the amount of training data given that this amount is similar to what children are estimated to be exposed to during the first 10 years of life. Second, we test the performance of a GPT-2 model trained on a 9-billion-token dataset to reach state-of-the-art next-word prediction performance on the human benchmark at different stages during training. Across both approaches, we find that (i) the models trained on a developmentally plausible amount of data already achieve near-maximal performance in capturing fMRI responses to sentences. Further, (ii) lower perplexity-a measure of next-word prediction performance-is associated with stronger alignment with human data, suggesting that models that have received enough training to achieve sufficiently high next-word prediction performance also acquire representations of sentences that are predictive of human fMRI responses. In tandem, these findings establish that although some training is necessary for the models' predictive ability, a developmentally realistic amount of training (â¼100 million words) may suffice.

3.

Lexical-Semantic Content, Not Syntactic Structure, Is the Main Contributor to ANN-Brain Similarity of fMRI Responses in the Language Network.

Kauf, Carina; Tuckute, Greta; Levy, Roger; Andreas, Jacob; Fedorenko, Evelina.

Neurobiol Lang (Camb) ; 5(1): 7-42, 2024.

Article En | MEDLINE | ID: mdl-38645614

Representations from artificial neural network (ANN) language models have been shown to predict human brain activity in the language network. To understand what aspects of linguistic stimuli contribute to ANN-to-brain similarity, we used an fMRI data set of responses to n = 627 naturalistic English sentences (Pereira et al., 2018) and systematically manipulated the stimuli for which ANN representations were extracted. In particular, we (i) perturbed sentences' word order, (ii) removed different subsets of words, or (iii) replaced sentences with other sentences of varying semantic similarity. We found that the lexical-semantic content of the sentence (largely carried by content words) rather than the sentence's syntactic form (conveyed via word order or function words) is primarily responsible for the ANN-to-brain similarity. In follow-up analyses, we found that perturbation manipulations that adversely affect brain predictivity also lead to more divergent representations in the ANN's embedding space and decrease the ANN's ability to predict upcoming tokens in those stimuli. Further, results are robust as to whether the mapping model is trained on intact or perturbed stimuli and whether the ANN sentence representations are conditioned on the same linguistic context that humans saw. The critical result-that lexical-semantic content is the main contributor to the similarity between ANN representations and neural ones-aligns with the idea that the goal of the human language system is to extract meaning from linguistic strings. Finally, this work highlights the strength of systematic experimental manipulations for evaluating how close we are to accurate and generalizable models of the human language network.

4.

Language in Brains, Minds, and Machines.

Tuckute, Greta; Kanwisher, Nancy; Fedorenko, Evelina.

Annu Rev Neurosci ; 2024 Apr 26.

Article En | MEDLINE | ID: mdl-38669478

It has long been argued that only humans could produce and understand language. But now, for the first time, artificial language models (LMs) achieve this feat. Here we survey the new purchase LMs are providing on the question of how language is implemented in the brain. We discuss why, a priori, LMs might be expected to share similarities with the human language system. We then summarize evidence that LMs represent linguistic information similarly enough to humans to enable relatively accurate brain encoding and decoding during language processing. Finally, we examine which LM properties-their architecture, task performance, or training-are critical for capturing human neural responses to language and review studies using LMs as in silico model organisms for testing hypotheses about language. These ongoing investigations bring us closer to understanding the representations and processes that underlie our ability to comprehend sentences and express thoughts in language.

5.

Cognitive Computational Neuroscience of Language: Using Computational Models to Investigate Language Processing in the Brain.

Lopopolo, Alessandro; Fedorenko, Evelina; Levy, Roger; Rabovsky, Milena.

Neurobiol Lang (Camb) ; 5(1): 1-6, 2024.

Article En | MEDLINE | ID: mdl-38645621

6.

Distributed Sensitivity to Syntax and Semantics throughout the Language Network.

Shain, Cory; Kean, Hope; Casto, Colton; Lipkin, Benjamin; Affourtit, Josef; Siegelman, Matthew; Mollica, Francis; Fedorenko, Evelina.

J Cogn Neurosci ; : 1-43, 2024 Apr 22.

Article En | MEDLINE | ID: mdl-38683732

Human language is expressive because it is compositional: the meaning of a sentence (semantics) can be inferred from its structure (syntax). It is commonly believed that language syntax and semantics are processed by distinct brain regions. Here we revisit this claim using precision fMRI methods to capture separation or overlap of function in the brains of individual participants. Contrary to prior claims, we find distributed sensitivity to both syntax and semantics throughout a broad frontotemporal brain network. Our results join a growing body of evidence for an integrated network for language in the human brain within which internal specialization is primarily a matter of degree rather than kind, in contrast with influential proposals that advocate distinct specialization of different brain areas for different types of linguistic functions.

7.

Dissociating language and thought in large language models.

Mahowald, Kyle; Ivanova, Anna A; Blank, Idan A; Kanwisher, Nancy; Tenenbaum, Joshua B; Fedorenko, Evelina.

Trends Cogn Sci ; 2024 Mar 19.

Article En | MEDLINE | ID: mdl-38508911

Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their performance on functional competence tasks remains spotty and often requires specialized fine-tuning and/or coupling with external modules. We posit that models that use language in human-like ways would need to master both of these competence types, which, in turn, could require the emergence of separate mechanisms specialized for formal versus functional linguistic competence.

8.

High-level language brain regions process sublexical regularities.

Regev, Tamar I; Kim, Hee So; Chen, Xuanyi; Affourtit, Josef; Schipper, Abigail E; Bergen, Leon; Mahowald, Kyle; Fedorenko, Evelina.

Cereb Cortex ; 34(3)2024 03 01.

Article En | MEDLINE | ID: mdl-38494886

A network of left frontal and temporal brain regions supports language processing. This "core" language network stores our knowledge of words and constructions as well as constraints on how those combine to form sentences. However, our linguistic knowledge additionally includes information about phonemes and how they combine to form phonemic clusters, syllables, and words. Are phoneme combinatorics also represented in these language regions? Across five functional magnetic resonance imaging experiments, we investigated the sensitivity of high-level language processing brain regions to sublexical linguistic regularities by examining responses to diverse nonwords-sequences of phonemes that do not constitute real words (e.g. punes, silory, flope). We establish robust responses in the language network to visually (experiment 1a, n = 605) and auditorily (experiments 1b, n = 12, and 1c, n = 13) presented nonwords. In experiment 2 (n = 16), we find stronger responses to nonwords that are more well-formed, i.e. obey the phoneme-combinatorial constraints of English. Finally, in experiment 3 (n = 14), we provide suggestive evidence that the responses in experiments 1 and 2 are not due to the activation of real words that share some phonology with the nonwords. The results suggest that sublexical regularities are stored and processed within the same fronto-temporal network that supports lexical and syntactic processes.

Brain Mapping , Language , Brain Mapping/methods , India , Brain/diagnostic imaging , Brain/physiology , Linguistics , Magnetic Resonance Imaging

9.

Functional characterization of the language network of polyglots and hyperpolyglots with precision fMRI.

Malik-Moraleda, Saima; Jouravlev, Olessia; Taliaferro, Maya; Mineroff, Zachary; Cucu, Theodore; Mahowald, Kyle; Blank, Idan A; Fedorenko, Evelina.

Cereb Cortex ; 34(3)2024 03 01.

Article En | MEDLINE | ID: mdl-38466812

How do polyglots-individuals who speak five or more languages-process their languages, and what can this population tell us about the language system? Using fMRI, we identified the language network in each of 34 polyglots (including 16 hyperpolyglots with knowledge of 10+ languages) and examined its response to the native language, non-native languages of varying proficiency, and unfamiliar languages. All language conditions engaged all areas of the language network relative to a control condition. Languages that participants rated as higher proficiency elicited stronger responses, except for the native language, which elicited a similar or lower response than a non-native language of similar proficiency. Furthermore, unfamiliar languages that were typologically related to the participants' high-to-moderate-proficiency languages elicited a stronger response than unfamiliar unrelated languages. The results suggest that the language network's response magnitude scales with the degree of engagement of linguistic computations (e.g. related to lexical access and syntactic-structure building). We also replicated a prior finding of weaker responses to native language in polyglots than non-polyglot bilinguals. These results contribute to our understanding of how multiple languages coexist within a single brain and provide new evidence that the language network responds more strongly to stimuli that more fully engage linguistic computations.

Multilingualism , Humans , Magnetic Resonance Imaging , Language , Brain/diagnostic imaging , Brain/physiology , Brain Mapping

10.

Let's move forward: Image-computable models and a common model evaluation scheme are prerequisites for a scientific understanding of human vision - CORRIGENDUM.

DiCarlo, James J; Yamins, Daniel L K; Ferguson, Michael E; Fedorenko, Evelina; Bethge, Matthias; Bonnen, Tyler; Schrimpf, Martin.

Behav Brain Sci ; 47: e66, 2024 Feb 02.

Article En | MEDLINE | ID: mdl-38305315

11.

Driving and suppressing the human language network using large language models.

Tuckute, Greta; Sathe, Aalok; Srikant, Shashank; Taliaferro, Maya; Wang, Mingye; Schrimpf, Martin; Kay, Kendrick; Fedorenko, Evelina.

Nat Hum Behav ; 8(3): 544-561, 2024 Mar.

Article En | MEDLINE | ID: mdl-38172630

Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.

Comprehension , Language , Humans , Comprehension/physiology , Brain/diagnostic imaging , Brain/physiology , Linguistics/methods , Brain Mapping/methods

12.

Functional characterization of the language network of polyglots and hyperpolyglots with precision fMRI.

Malik-Moraleda, Saima; Jouravlev, Olessia; Taliaferro, Maya; Mineroff, Zachary; Cucu, Theodore; Mahowald, Kyle; Blank, Idan A; Fedorenko, Evelina.

bioRxiv ; 2024 Jan 30.

Article En | MEDLINE | ID: mdl-36711949

How do polyglots-individuals who speak five or more languages-process their languages, and what can this population tell us about the language system? Using fMRI, we identified the language network in each of 34 polyglots (including 16 hyperpolyglots with knowledge of 10+ languages) and examined its response to the native language, non-native languages of varying proficiency, and unfamiliar languages. All language conditions engaged all areas of the language network relative to a control condition. Languages that participants rated as higher-proficiency elicited stronger responses, except for the native language, which elicited a similar or lower response than a non-native language of similar proficiency. Furthermore, unfamiliar languages that were typologically related to the participants' high-to-moderate-proficiency languages elicited a stronger response than unfamiliar unrelated languages. The results suggest that the language network's response magnitude scales with the degree of engagement of linguistic computations (e.g., related to lexical access and syntactic-structure building). We also replicated a prior finding of weaker responses to native language in polyglots than non-polyglot bilinguals. These results contribute to our understanding of how multiple languages co-exist within a single brain and provide new evidence that the language network responds more strongly to stimuli that more fully engage linguistic computations.

13.

Let's move forward: Image-computable models and a common model evaluation scheme are prerequisites for a scientific understanding of human vision.

DiCarlo, James J; Yamins, Daniel L K; Ferguson, Michael E; Fedorenko, Evelina; Bethge, Matthias; Bonnen, Tyler; Schrimpf, Martin.

Behav Brain Sci ; 46: e390, 2023 Dec 06.

Article En | MEDLINE | ID: mdl-38054303

In the target article, Bowers et al. dispute deep artificial neural network (ANN) models as the currently leading models of human vision without producing alternatives. They eschew the use of public benchmarking platforms to compare vision models with the brain and behavior, and they advocate for a fragmented, phenomenon-specific modeling approach. These are unconstructive to scientific progress. We outline how the Brain-Score community is moving forward to add new model-to-human comparisons to its community-transparent suite of benchmarks.

Brain , Neural Networks, Computer , Humans

14.

Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely.

Kauf, Carina; Ivanova, Anna A; Rambelli, Giulia; Chersoni, Emmanuele; She, Jingyuan Selena; Chowdhury, Zawad; Fedorenko, Evelina; Lenci, Alessandro.

Cogn Sci ; 47(11): e13386, 2023 Nov.

Article En | MEDLINE | ID: mdl-38009752

Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs' semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pretrained LLMs (from 2018's BERT to 2023's MPT) assign a higher likelihood to plausible descriptions of agent-patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n = 1215), we found that pretrained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign a higher likelihood to possible versus impossible events (The teacher bought the laptop vs. The laptop bought the teacher). However, LLMs show less consistent preferences for likely versus unlikely events (The nanny tutored the boy vs. The boy tutored the nanny). In follow-up analyses, we show that (i) LLM scores are driven by both plausibility and surface-level sentence features, (ii) LLM scores generalize well across syntactic variants (active vs. passive constructions) but less well across semantic variants (synonymous sentences), (iii) some LLM errors mirror human judgment ambiguity, and (iv) sentence plausibility serves as an organizing dimension in internal LLM representations. Overall, our results show that important aspects of event knowledge naturally emerge from distributional linguistic patterns, but also highlight a gap between representations of possible/impossible and likely/unlikely events.

Language , Semantics , Male , Humans , Knowledge , Reading , Judgment

15.

Joint cortical registration of geometry and function using semi-supervised learning.

Li, Jian; Tuckute, Greta; Fedorenko, Evelina; Edlow, Brian L; Fischl, Bruce; Dalca, Adrian V.

ArXiv ; 2023 Oct 16.

Article En | MEDLINE | ID: mdl-37744470

Brain surface-based image registration, an important component of brain image analysis, establishes spatial correspondence between cortical surfaces. Existing iterative and learning-based approaches focus on accurate registration of folding patterns of the cerebral cortex, and assume that geometry predicts function and thus functional areas will also be well aligned. However, structure/functional variability of anatomically corresponding areas across subjects has been widely reported. In this work, we introduce a learning-based cortical registration framework, JOSA, which jointly aligns folding patterns and functional maps while simultaneously learning an optimal atlas. We demonstrate that JOSA can substantially improve registration performance in both anatomical and functional domains over existing methods. By employing a semi-supervised training strategy, the proposed framework obviates the need for functional data during inference, enabling its use in broad neuroscientific domains where functional data may not be observed. The source code of JOSA will be released to the public at https://voxelmorph.net.

16.

Grammatical cues to subjecthood are redundant in a majority of simple clauses across languages.

Mahowald, Kyle; Diachek, Evgeniia; Gibson, Edward; Fedorenko, Evelina; Futrell, Richard.

Cognition ; 241: 105543, 2023 Dec.

Article En | MEDLINE | ID: mdl-37713956

Grammatical cues are sometimes redundant with word meanings in natural language. For instance, English word order rules constrain the word order of a sentence like "The dog chewed the bone" even though the status of "dog" as subject and "bone" as object can be inferred from world knowledge and plausibility. Quantifying how often this redundancy occurs, and how the level of redundancy varies across typologically diverse languages, can shed light on the function and evolution of grammar. To that end, we performed a behavioral experiment in English and Russian and a cross-linguistic computational analysis measuring the redundancy of grammatical cues in transitive clauses extracted from corpus text. English and Russian speakers (n = 484) were presented with subjects, verbs, and objects (in random order and with morphological markings removed) extracted from naturally occurring sentences and were asked to identify which noun is the subject of the action. Accuracy was high in both languages (â¼89% in English, â¼87% in Russian). Next, we trained a neural network machine classifier on a similar task: predicting which nominal in a subject-verb-object triad is the subject. Across 30 languages from eight language families, performance was consistently high: a median accuracy of 87%, comparable to the accuracy observed in the human experiments. The conclusion is that grammatical cues such as word order are necessary to convey subjecthood and objecthood in a minority of naturally occurring transitive clauses; nevertheless, they can (a) provide an important source of redundancy and (b) are crucial for conveying intended meaning that cannot be inferred from the words alone, including descriptions of human interactions, where roles are often reversible (e.g., Ray helped Lu/Lu helped Ray), and expressing non-prototypical meanings (e.g., "The bone chewed the dog.").

17.

Constructed languages are processed by the same brain mechanisms as natural languages.

Malik-Moraleda, Saima; Taliaferro, Maya; Shannon, Steve; Jhingan, Niharika; Swords, Sara; Peterson, David J; Frommer, Paul; Okrand, Marc; Sams, Jessie; Cardwell, Ramsey; Freeman, Cassie; Fedorenko, Evelina.

bioRxiv ; 2023 Jul 28.

Article En | MEDLINE | ID: mdl-37546901

What constitutes a language? Natural languages share some features with other domains: from math, to music, to gesture. However, the brain mechanisms that process linguistic input are highly specialized, showing little or no response to diverse non-linguistic tasks. Here, we examine constructed languages (conlangs) to ask whether they draw on the same neural mechanisms as natural languages, or whether they instead pattern with domains like math and logic. Using individual-subject fMRI analyses, we show that understanding conlangs recruits the same brain areas as natural language comprehension. This result holds for Esperanto (n=19 speakers)- created to resemble natural languages-and fictional conlangs (Klingon (n=10), Na'vi (n=9), High Valyrian (n=3), and Dothraki (n=3)), created to differ from natural languages, and suggests that conlangs and natural languages share critical features and that the notable differences between conlangs and natural language are not consequential for the cognitive and neural mechanisms that they engage.

18.

The language network is not engaged in object categorization.

Benn, Yael; Ivanova, Anna A; Clark, Oliver; Mineroff, Zachary; Seikus, Chloe; Silva, Jack Santos; Varley, Rosemary; Fedorenko, Evelina.

Cereb Cortex ; 33(19): 10380-10400, 2023 09 26.

Article En | MEDLINE | ID: mdl-37557910

The relationship between language and thought is the subject of long-standing debate. One claim states that language facilitates categorization of objects based on a certain feature (e.g. color) through the use of category labels that reduce interference from other, irrelevant features. Therefore, language impairment is expected to affect categorization of items grouped by a single feature (low-dimensional categories, e.g. "Yellow Things") more than categorization of items that share many features (high-dimensional categories, e.g. "Animals"). To test this account, we conducted two behavioral studies with individuals with aphasia and an fMRI experiment with healthy adults. The aphasia studies showed that selective low-dimensional categorization impairment was present in some, but not all, individuals with severe anomia and was not characteristic of aphasia in general. fMRI results revealed little activity in language-responsive brain regions during both low- and high-dimensional categorization; instead, categorization recruited the domain-general multiple-demand network (involved in wide-ranging cognitive tasks). Combined, results demonstrate that the language system is not implicated in object categorization. Instead, selective low-dimensional categorization impairment might be caused by damage to brain regions responsible for cognitive control. Our work adds to the growing evidence of the dissociation between the language system and many cognitive tasks in adults.

Aphasia , Language , Humans , Adult , Brain/diagnostic imaging , Aphasia/diagnostic imaging

19.

Author Correction: Single-neuronal predictions of others' beliefs in humans.

Jamali, Mohsen; Grannan, Benjamin L; Fedorenko, Evelina; Saxe, Rebecca; Báez-Mendoza, Raymundo; Williams, Ziv M.

Nature ; 618(7966): E25, 2023 Jun.

Article En | MEDLINE | ID: mdl-37264080

20.

Erratum: "Causal Contributions of the Domain-General (Multiple Demand) and the Language-Selective Brain Networks to Perceptual and Semantic Challenges in Speech Comprehension".

MacGregor, Lucy J; Gilbert, Rebecca A; Balewski, Zuzanna; Mitchell, Daniel J; Erzinçlioglu, Sharon W; Rodd, Jennifer M; Duncan, John; Fedorenko, Evelina; Davis, Matthew H.

Neurobiol Lang (Camb) ; 4(1): i-ii, 2023.

Article En | MEDLINE | ID: mdl-37378073

[This corrects the article DOI: 10.1162/nol_a_00081.].